Analysis of Variance from Multiply Imputed Data Sets

نویسنده

  • Trivellore Raghunathan
چکیده

The analysis of variance is a popular method used in many scientific applications. There are standard software for handling unbalanced data due to missing values in the outcome/dependent variable. The analysis becomes difficult when the missing values are in predictors. Multiple imputation is an increasingly popular method for handling such incomplete data. This approach involves replacing the missing set of values by more than one plausible set of values, preferably generated from their posterior predictive distribution given the observed data. Each plausible set of imputed values when combined with the observed set of values results in a completed data. Each completed data set is analyzed separately and the point estimates and their standard errors are combined to form a single inference. Many analysis of variance models may be formulated as regression models and then apply the standard multiple imputation combining rules. This is often not possible when the design is complex involving repeated measures and/or nested, random or interaction effects. It may be more convenient to directly combine the analysis of variance tables generated from each completed data to test appropriate hypotheses. This paper develops a combining rule for the completed data mean squares. Approximate F -tests are developed and evaluated using the actual and simulated data sets. The method is extended to comparison of regression models using partial F-tests in multiple linear regression analysis or the deviance statistics in fitting regression models using the Generalized Estimating Equations.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multiple imputation and other resampling schemes for imputing missing observations

The problem of imputing missing observations under the linear regression model is considered. It is assumed that observations are missing at random and all the observations on the auxiliary or independent variables are available. Estimates of the regression parameters based on singly and multiply imputed values are given. Jackknife as well as bootstrap estimates of the variance of the singly im...

متن کامل

Order selection tests with multiply imputed data

Nonparametric tests for the null hypothesis that a function has a prescribed form are developed and applied to data sets with missing observations. Omnibus nonparametric tests such as the order selection tests, do not need to specify a particular alternative parametric form, and have power against a large range of alternatives. More specifically, likelihood-based order selection tests are defin...

متن کامل

Selection of Variables that Influence Drug Injection in Prison: Comparison of Methods with Multiple Imputed Data Sets

Background: Prisoners, compared to the general population, are at greater risk of infection. Drug injection is the main route of HIV transmission, in particular in Iran. What would be of interest is to determine variables that govern drug injection among prisoners. However, one of the issues that challenge model building is incomplete national data sets. In this paper, we addressed the process ...

متن کامل

Intent-to-treat analysis for longitudinal studies with drop-outs.

We consider intent-to-treat (IT) analysis of clinical trials involving longitudinal data subject to drop-out. Common methods, such as Last Observation Carried Forward imputation or incomplete-data methods based on models that assume random dropout, have serious drawbacks in the IT setting. We propose a method that involves multiple imputation of the missing values following drop-out based on an...

متن کامل

Multiple Imputation for Missing Data: Concepts and New Development

Multiple imputation provides a useful strategy for dealing with data sets with missing values. Instead of filling in a single value for each missing value, Rubin’s (1987) multiple imputation procedure replaces each missing value with a set of plausible values that represent the uncertainty about the right value to impute. These multiply imputed data sets are then analyzed by using standard proc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011